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abstract 

h— •} ' The amplitude of the angular two-point galaxy auto-correlation function w(ff) for 

, galaxies at z ~ 2 is estimated for galaxies in the Hubble Deep Field by using a 

■ U < 27 complete sub-sample. The [/-band selection ensures little contamination from 

z > 2.5 galaxies, while photometric redshifts minimise the contribution from low 

t— i . redshift galaxies. 

J> ' (i) It is confirmed that the amplitude of the correlation can be corrected for the 

ON , integral constraint (lack of large scale variance) without having to make assumptions 

ON ■ about the shape of the correlation function and by avoiding the introduction of linear 

error terms. The estimate using this technique is w(9 ps 5") = 0.10 ±0.09. Estimators 
which assume a power law of a given slope and include linear error terms would double 
this value. 

(ii) If the biases introduced in faint galaxy selection due to obscuration by large 
objects are not corrected for by masking areas around them, then the estimate would 
be w{6 re 5") = 0.16 ±0.07. 

(iii) The effective (three-dimensional) galaxy pair separation at 5" and this redshift 
q ■ range is ps 25h _1 kpc— 250h~ 1 kpc, so the correction to the spatial correlation function 
5h ' £(r) due to exclusion of overlapping galaxy dark matter haloes should be considered. 
c/3 , For clustering stable in proper units in an = 1. A = universe, our w(5") estimate 

(a) implies a present-day correlation length of tq ~ 2.6l 1 '7h~ 1 Mpc if halo overlapping 
£> . is ignored, but (b) for a present-day correlation length of ro = 5.5h _1 Mpc implies 

' that a typical halo exclusion radius is r na i = 70t3Q°h~ 1 kpc. For il = 0.1, Ao = 0.9, 

j_j ■ the corresponding values are (a) r ~ 5.8tg;gh _1 Mpc and (b) r na i < 210h~ 1 kpc (Ict 

& ' upper limit). 

(iv) The decreasing correlation period (DCP) of a high initial bias in the spatial 
correlation function is not detected at this redshift. For an = 1,A = universe 
and stable clustering in proper units, possible detections of the DCP in other work 
would imply that the values of £ at redshifts greater than z t = 1.7 ± 0.9 would be 
[(1 + z)/(l + zt)] 2 ' 1±3 6 times their values at z t , which is consistent with our lack of a 
detection at z ~ 2. 
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1 INTRODUCTION 

Structure in the Universe as represented by visible galax- 
ies is commonly represented by the two-point spatial 
auto-correlation function, This can be approximately 

parametrised as a power law 

«r) = (ro/r-F (1) 



where r are is the spatial separation of galaxy pairs, r and ro 
are expressed in comoving coordinates and 7 represents the 
approach to homogeneity at large length scales (e.g. Groth & 
Peebles 1977). For a recent review of the galaxy correlation 
function, see Peacock (1997). 

Since the major epoch of star formation in the Universe 
seems to have taken place at z ~ 1 — 2 (Madau et al. 1996), 
this is a particularly interesting period. It would be useful 
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to observationally estimate £ at this epoch in order to link 
gravitational theory with the study of luminous objects. 

Villumsen, Freudling & da Costa (1997) have already 
made an estimate of the projected, angular two-point auto- 
correlation function, w(0), (where 9 is the angle separat- 
ing two galaxies) and hence £, for galaxies in the Hubble 
Deep Field (hereafter, HDF; Williams et al. 1996), to me- 
dian redshifts up to 2 mcc j~ 1-9 from an /-band selected 
sample. Several recent estimates for £ have also been made 
for both higher redshift objects (Giavalisco et al. 1998 at 
z ~ 3; Miralles, Pello & Roukema (1999) 1998 in the HDF 
at 2.5 < 2 < 4.5) and for lower redshift galaxies (Connolly, 
Szalay & Brunner 1998 in the HDF at z < 1.6). 

An improvement on the Villumsen et al. estimate should 
be possible by using (i) a [/-band selected sample to re- 
move nearly all galaxies at redshifts z £ 2.5, and (ii) pho- 
tometric redshift redshift estimation on this sample to re- 
move most galaxies at both lower and higher redshifts than 
1.5 < z < 2.5. 

This is the option chosen here, using Mobasher & 
Mazzei's (1998) [/-selected sample of HDF galaxies, dom- 
inated by ellipticals and starburst galaxies at z ~ 2 (§2). 
Although galaxies selected in U at z ~ 2 are selected in 
rest-frame wavelengths which do not correspond to those in 
which galaxies in typical surveys are selected, the high rate 
of star formation at this epoch implies that the galaxy mix 
may not be too different from other samples. 

As in Connolly et al. (1998) and Miralles et al. (1998), 
w is estimated in a redshift range limited by photometric 
redshifts (§3) and Limber's equation [equation (7)] is used 
to consider interpretation in terms of the spatial correlation 
function £ (§3.2). 

In addition, (i) the estimate of w is made without having 
to make a prior assumption about the shape and slope of the 
correlation function (§3.1) and by avoiding introduction of 
linear terms in the correction for the integral constraint; and 
(ii) the effect of bias introduced in regions of sky obscured 
by bright objects is demonstrated (§3.1.2). 

The word 'bias' in the previous sentence is used in its 
general sense. Since the primary aim of this paper is observa- 
tional estimation, galaxy-to-matter bias (e.g. Ostriker 1993; 
Peebles et al. 1989; Matarrese et al. 1997) is not treated 
here, apart from discussion of a possible high redshift, high 
initial bias in §5.3. 

In interpreting the w estimate via integration of the rel- 
ativistic version of Limber's equation, the size of galaxy dark 
matter haloes is explicitly considered. The effective three- 
dimensional separations studied here, i.e. the median separa- 
tion of galaxy pairs which contribute to 50% of the projected 
correlation, is r^g ~ 25h~ 1 kpc— 250h~ 1 kpc (Roukema & 
Valls-Gabaud 1997). Dark matter haloes must overlap at 
these length scales and it seems likely that some modifica- 
tion of the shape of the correlation function is necessary, 
since the dominant physics in coexistence of galaxy pairs 
becomes local rather than cosmological. 

The simplest correction seems to be to introduce a 
smooth cutoff in £ below a characteristic halo radius. This 
is the approach adopted here (§3.2.3). 

The resulting estimates of w are presented in §4.1. In 
§4.2, these are interpreted in terms of the spatial correlation 
function with and without a correction for halo radii, for a 
range of simple power law hypotheses for correlation evolu- 



tion with redshift, and for a realistic range in values for the 
cosmological curvature parameters, fio and Ao. 

Comparison with and implications from other estimates 
of £, in particular regarding the decreasing correlation period 
(DCP), are provided in §5, and §6 concludes. 

The Hubble constant used here is h = Ho/ 
100km s _1 Mpc -1 . The spatial correlation function is pre- 
sented in comoving coordinates. Galaxy halo sizes are dis- 
cussed in proper units. The metric parameters fio and Ao 
are quoted when first used (Fig. 10). 



2 OBSERVATIONAL DATA SET 

The HDF is presented in detail by Williams et al. (1996). 
Selection of galaxies in the [/-band (strictly speaking, in the 
F300W band) for the three Wide Field Camera (WFC) fields 
of the HDF is described in Mobasher & Mazzei (1998), and 
is estimated to be complete to U < 27 (Uab magnitudes 
used here). 

Photometric redshifts were calculated by Mobasher & 
Mazzei (1998) using template spectral energy distributions 
(SED's) derived from stellar evolutionary population syn- 
thesis models. These extend from the far-UV to 1mm, self- 
consistently include stellar emission, internal extinction and 
re-emission by dust, and include a range of metallicities. The 
synthetic SED's were constrained both by locally observed 
galaxies of different morphological types and by the HDF 
galaxies for which spectroscopic redshifts are available. The 
r.m.s. scatter between spectroscopic and photometric red- 
shifts is estimated as 0.11. 

It should be noted that due to the lack of strong breaks 
in the optical passbands, the uncertainties in photometric 
redshifts are generally greater within the 1.5 < z < 2.5 
interval than at lower and higher redshifts. This is not a 
problem for this analysis. Since only the projected correla- 
tion function is estimated, errors within the interval have 
no effect. Scattering in the redshifts at the low redshift end 
might have some effect, since although the numbers of galax- 
ies are relatively lower, their autocorrelation is higher, than 
at high redshifts. Uncertainty at the high redshift end should 
be minimised by the [/-band selection. 



3 METHOD 

3.1 Estimation of the Angular Correlation 
Function w(9) 

3.1.1 Angular limits 

Since variance on the scale of the sample can only be cor- 
rected by information external to the sample, the estimation 
of w at angles similar to the size of either a single WFC field 
or the combined field cannot statistically represent structure 
at that projected angular scale. The presentation of angu- 
lar correlations up to 80" (Villumsen et al. 1997) or 220" 
(Connolly et al. 1998) can therefore be considered as provid- 
ing investigations of individual examples of structure, which 
could be used to select samples of individual high redshift 
clustered structures (or voids). 

Although that approach is not without interest, the goal 
here is for the study of the statistics of galaxy clustering, so 
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the upper limit in angles used for correlation estimates is 
« 40", half the size of a single WFC field. Previous expe- 
rience (Roukcma & Peterson 1994) suggests that this is a 
conservative limit above which large scale variance should 
be excluded. For the /-band limited HDF sample of Villum- 
sen et al. (1997), formal interpretation of the error bars of 
Villumsen et al.'s Fig. 1 shows that negative and positive 
offsets of w(6i) estimates at 25-80" from the power law cor- 
rected fit are mostly more significant than positive offsets 
on small scales. An alternative fit of any reasonably smooth 
function through the points would not reduce what is most 
simply interpreted as large scale variance. 

At small angular scales there is certainly no problem 
in confusion of faint sources for 6 ^>> 0.1"; these are Hub- 
ble Space Telescope data. On the contrary, the limitation is 
physical. Colley et al. (1996, 1997) have pointed out a highly 
significant excess of object pairs separated by less than 
about 1" in the HDF. This corresponds to about 6h~ 1 kpc 
in separation perpendicular to the line-of-sight (to within 
a factor of two depending on the values of the metric pa- 
rameters), in proper units. Roukema & Valls-Gabaud (1997) 
showed that if these objects were individual galaxies with a 
correlation similar to or higher than that of equation (1), 
but allowing for evolution and a wide range of acceptable 
parameters, then the measured strength of w would imply 
that their three-dimensional separations would typically be 
the same as the perpendicular separation, and a large ma- 
jority of pair separations would be less than 15-30h _1 kpc 
(proper). 

Colley et al. also point out that differential bolometric 
surface brightness dimming of (1 + z) 4 for diffuse objects 
relative to (1 + z) 2 for point sources implies that one should 
in fact expect high redshift optical imaging to split galaxies 
into clumps of objects such as H II regions. 

While a component of the sub-arcsecond clustering de- 
tected by Colley et al. may well be real galaxy cluster- 
ing [though highly biased as predicted by Ogawa, Roukema 
& Yamashita (1997), apparently detected by Steidel et al. 
(1998) and modelled by Bagla (1998)], the choice adopted 
here is to exclude this clustering. This is done (i) by setting 
a minimum angular separation of 2" down to which the cor- 
relation is estimated and (ii) considering objects separated 
by less than 0.5" as single objects. The second of these two 
limits has to be significantly smaller than the first in order 
to avoid introducing a bias. 

In order to maximise signal to noise, large bins are 
adopted as in Roukema & Peterson (1994). The range 2- 
70" is divided into six logarithmic bins, of which the first 
three are regrouped into a single bin in order to reduce the 
noise, and the largest angular bin is plotted but not used in 
for correlation estimates or integral constraint corrections. 
The range used for correlation estimates is therefore 2-39". 

3.1.2 Masking for Bright Sources 

In any image of faint galaxies, even one such as the HDF 
which was designed to avoid containing any 'bright' galax- 
ies (or stars), there are still some galaxies which are brighter 
than others. It is clear in the HDF (e.g. Fig. 8 of Williams et 
al. 1996) that there are some bright galaxies which occupy 
non-negligible fractions of the total solid angle. As is gen- 
erally pointed out in angular correlation studies (e.g. Neu- 



schaefer et al. 1991; Roukema & Peterson 1994), subtraction 
of the light from bright objects is imperfect, both due to the 
light profile only being a smooth approximation and due to 
the Poisson noise from the object adding to that of the sky. 
This implies that (a) background galaxies (or foreground 
dwarf galaxies) which ought to be detected after subtraction 
of the bright objects may be missed (due to being subtracted 
away) and (b) Poisson noise, real components of the bright 
object (e.g. a globular cluster or a knot of star formation if 
the bright object is a galaxy), cosmic rays or combinations 
of the three may be mistakenly detected as galaxies. 

Estimating the angular correlation function depends 
critically on the selection criteria being constant across the 
whole field (or for the selection function to be known pre- 
cisely enough to enable a weighting correction to be applied) 
and can be biased by either of these effects. Analyses of the 
HDF tend to be conservative in reducing the possibility of 
mistaking artifacts for galaxies, so the problem here is more 
likely to be (a) than (b). 

In either case, the correction is to apply a 'mask' to 
regions around bright objects. In this study, use of the U- 
selected catalogue would be insufficient since the photo- 
metric redshift estimates require detection in all bands and 
so would be affected by bright objects in other bands. So, 
Williams et al.'s catalogue is used to mask out circular areas 
around all objects of limiting isophotal areas of greater than 
10 sq.arcsec. These regions are excluded when distributing 
random points for use in equation (2) . 



3.1.3 Different Estimators for w 

The estimation of the angular correlation function ampli- 
tude for small solid angle, faint surveys normally suffers 
from the problems of small numbers of galaxies, of a small 
signal-to-noise ratio and of correcting for variance in the 
number density of galaxies on the scale of the sample ob- 
served (also known as the integral constraint). In this sam- 
ple, the numbers of galaxies are relatively small, compared to 
the numbers in typical magnitude/surface brightness limited 
ground-based surveys extending to faint magnitudes (e.g. 
Efstathiou et al. 1991; Neuschaefer et al. 1991; Pritchet & 
Infante 1992; Couch et al. 1993; Roukema & Peterson 1994; 
Infante & Pritchet 1995 and Brainerd et al. 1995). 

However, the small angular size of the HDF implies that 
w can be estimated at angular separations of less than 10", 
where it is expected to give a stronger signal, so that the 
signal-to-noise ratio may be sufficiently high for a significant 
estimate. 

The other problem is that of sample-scale variance 
which both (a) can introduce noise into the correlation es- 
timate and (b) needs to corrected for in order to have a 
statistic which is significant in the limit of large angles. 

(a) Provided that the amplitude of this variance is 
smaller than unity, the arguments of Hamilton (1993) on 
how the noise it introduces can be reduced to a quadratic 
term rather than a linear term become valid. Since the three 
WFC fields are best considered independently (the small in- 
crease in the number of pairs across the borders between 
does not seem to justify the possible systematic effects of 
that particular sub-sample of the total close pair distribu- 
tion), a simple, order-of-magnitude estimate of the variance 
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is provided from comparison of galaxy numbers in the three 
fields. 

Hamilton's equation (15), adapted for the angular cor- 
relation function, is then an estimator which minimises the 
noise contribution of this variance relative to other estima- 
tors such as that of Landy & Szalay (1993). Hamilton's es- 
timator, expressed in conventional terms, is 



w(0) = 



N 2 

11 gr 



- 1 



(2) 



where N gg , N rr and N gr are the numbers of galaxy-galaxy 
pairs, random point-random point pairs and galaxy-random 
point pairs respectively, separated in each case by an angle 
lying in the interval [0, 6 + AO]. Landy & Szalay's estimator, 
in the same terminology, is 

iV 00 - 2N gr + N rr 



w(0) = 



Nrr 



(3) 



The random points are points selected pseudo- randomly 
from a uniform distribution in a field of the same shape 
and size as that from which the galaxies are selected. This 
corrects for edge effects. 

(b) Large-scale variance is not included in eq. (2). The 
common technique for re-introducing large-scale variance is 
to assume that w should be a power law of a given slope, 

w(0) = A^ 1 -' (4) 

where A m is a free parameter and 7 is that of eq. (1). The un- 
corrected estimates {1 + w(0i)} have a constant term added 
(or are multiplied by a constant factor) such that a power 
law of slope 1 — 7 is obtained. 

This assumption is that adopted by nearly all authors 
of faint galaxy angular correlation function analyses who 
do not have large enough fields to avoid having to make 
this integral constraint correction. Villumsen et al. (1997) 
and Connolly et al. (1998) adopt this technique, and assume 
7 = 1.8. 

Some authors only make the assumption that w is a 
power law, and try to estimate both the amplitude and slope 
of the power law by \ 2 minimisation (e.g. Neuschaefer et al. 
1991). 

This assumption is obviously sensitive to any minor 
changes in the slope or the shape of the correlation func- 
tion over this limited angular range. As reviewed by Peacock 
(1997), it is likely that £ is not perfectly fit by a single power 
law. Either a double power law or a fitting function such as 
that of Hamilton et al. (1991) are likely to be closer to the 
intrinsic shape of £. The effective slope over a small range 
in angle may be different enough from the assumed value 
of 7 to add significant systematic error to the amplitude 
estimate. 

Moreover, this corrected formula reintroduces linear er- 
ror terms into the estimate. Effectively, the estimator be- 
comes 

2N gr + N r , 



w (0) = ^SS. 



+ C 



(5) 



iV rr 

for some constant C. 

Hamilton presents a version of this equation, [his 
eq. (24)], which optimally corrects for the large-scale vari- 



w(0) = 



N 



2n cst N gr + "est N r 



where is the mean number density estimated by some 
means external to the sample, divided by the number density 
of the sample itself. 

It is clear that varying the value of C in eq. (5) is not 
equivalent to varying n es t in eq. (6). Finding a best fit for 
w(0), subject to the constraint that it has a certain shape 
and slope, will not give the same results in the two equations. 

In addition, Hamilton suggests using large scale vari- 
ance estimated from very large scales such as the cosmic 
microwave background measurements by COBE as the ex- 
ternal constraint. However, the extrapolation over time and 
space from COBE measurements to the present work would 
obviously require numerous assumptions. 

Since in the present case the three WFC fields are anal- 
ysed independently, a simple way of estimating the n cs t in 
the present case is to use the mean density of the three fields 
as the value of n e st to use in evaluating eq. (6) for any single 
field. This should correct for variance on a scale considerably 
larger than that of interest. 

As is seen below, this results in a mean correlation 
function which is positive and consistent with a power law, 
and additionally avoids having to assume that it should be a 
power law in order to make the correction. 

In principle, there should still be a bias from variance 
on yet larger scales, but without large-scale galaxy surveys 
at z ~ 2 there is no obvious way to make this correction di- 
rectly from observational data, apart from adopting the con- 
ventional practice of assuming that w (0) should be a power 
law of a given slope. 

The uncertainty in the correlation estimates is itself es- 
timated, for any angular bin, by the dispersion in the w 
estimates between the three WFC fields. 



3.2 Relating (ton 

3.2.1 Use of Redshifts in Limber's Equation 

The angular correlation function (small angle approxima- 
tion) is given by the the double integration of £(r, 2), 



w(0,N z ) 



J dzN z {z) 2 J du£{r,z) 
[J dzN z (z)] 2 



(7) 



n^t Nr 



(6) 



where z — (z± + Z2)/2 and u = z\ — 22 parametrise the 
redshifts of two galaxies at redshifts z\ and 22, r(z, u) is the 
spatial separation of the two galaxies, and N z (z) =dN/dz is 
the redshift distribution of the sample studied (Limber 1953; 
Phillips et al. 1978; Peebles 1980; Efstathiou et al. 1991). 

In the present case, Connolly et al.'s (1998) method of 
modelling a redshift distribution by summing the individ- 
ual photometric redshifts, represented as Gaussian distribu- 
tions centred on the estimated values, where a z = 0.11, is 
adopted. 



3.2.2 Low Redshift Contribution to Limber's Equation 

However, as remarked upon by Roukema & Valls-Gabaud 
(1997), the physical scales of the function £ which contribute 
to the angular signal need to be carefully considered. 

In principle, the combination of very short length scales 
and very high correlations in the very low z component of 
the cone of observation can be a problem in the integration 
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Figure 1. HDF [/-selected z ~ 2 galaxies shown as 4" X 4" images extracted from the 1024 X 1024 pixel F300W mosaic of the HDF 
archive. The pixel scale is about 6.25 pix/". North is to the upper left, east is to the lower left. Circles show areas excluded due to 
objects occupying large solid angles from which the z fts 2 galaxies would not be selected in an unbiased way. Concentric circles indicate 
that Williams et al. (1996) chose various options for uniting bright 'split' galaxies; in such cases, the largest of the concentric circles is 
the effective exclusion zone here. The boundaries used by Mobasher & Mazzci (1998) are shown for the three WFC fields, labelled #2 
(upper left), #3 (bottom left) and #4 (bottom right). One excluded area (circle) appears in the Planetary Camera field, which is not 
analysed. 



of Limber's equation (whether analytical or numerical). This 
is because although the volume at low redshifts is small, the 
correlation modelled by a power law at small separations 
becomes extremely high*. The resulting contribution to w 
can be non- negligible (see Fig. 1, Roukema & Valls-Gabaud). 

Although this could be considered to have some phys- 
ical meaning, e.g. to represent the clustering of very faint 
(17 ^ 27) dwarfs in the Local Group, the small angular size 
of the field (« 4.5xl0~ 7 sr) implies that any celestial pop- 
ulation of objects needs to have at least « 2.8 xlO 7 objects 
over 4-7T sr in order to expect (in the mean) a single object 
in the HDF. So a few small dwarf spheroidals distributed on 
the scale of the local supercluster could possibly be present 



* It was pointed out by Bruno Maillard (personal communica- 
tion) that since the observer lives in a non-random point of space, 
i.e. in a group and local super-cluster, the correct modelling of the 
correlation should be even higher towards zero pair separations 
at near zero redshifts. 



in the HDF, but the presence of a Local Group galaxy would 
only be due to a very rare event. 

In the present case, the interest is in cosmological dis- 
tances, so photometric redshift estimation excludes this very 
low redshift component — certainly in the calculation and 
most probably in the observational data. 



3.2.3 Galaxy Halo Sizes 

Given the exclusion of low redshifts (z <, 1.5), the minimum 
three-dimensional separation considered in the integral is 
not too different from the perpendicular separation at the 
median redshift of the redshift band 1.5 < z <, 2.5. The per- 
pendicular separations over these redshifts are considerably 
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Figure 2. HDF [/-selected z ~ 2 galaxies, as for Fig. 1 but without any masking of areas around large objects. 



smaller than 100-200h 1 kpc, i.e. than likely present-day 
sizes of typical L* galaxy haloes^ . 

Is it physically reasonable for galaxy pairs to exist as 
close to one another as 6h _1 kpc for long enough to have a 
chance of being observed in an HDF-like sample? 

Hierarchical galaxy formation models would imply that 
galaxy haloes should be smaller at z ~ 2 than at the present. 
However, at typical redshifts of Lyman-ev and metal line ab- 
sorbers in front of quasars, the haloes of galaxies are es- 
timated as having gaseous radii of around 50-200h _1 kpc 
(Bergeron & Boisse 1991; Bechtold et al. 1994; Lanzetta et 
al. 1995; Fang et al. 1996; Le Brun et al. 1996; Chen et al. 
1998). 

It is dynamically unlikely that the dark matter halo 
radii could be smaller than the gaseous radii, so the closest 
fit between theory and observation would be for dark matter 
halo radii to be about the same size as the gaseous radii. 

It could be possible that the z ~ 2 galaxies in the HDF 
are mostly dwarfs (defined by mass) , brightened by bursts of 
star formation, and that the types of galaxies corresponding 

t The word 'haloes' is used here in the galaxy formation sense 
of dark matter haloes containing baryonic and/or nonbaryonic 
non-luminous matter. 



to quasar absorbers are very rare in the HDF. In this case, 
galaxies could indeed coexist according to an extrapolation 
of £ to small pair separations. Alternatively, since the half- 
light radii of HDF galaxies are small, it could be possible 
that the baryonic, stellar cores of haloes, i.e. galaxies, arc 
tightly enough bound that they co-orbit in the haloes (i) 
without merging over a Hubble time and (ii) they do this is 
such a way that their pair separations remain statistically 
well modelled by £. 

However, it seems reasonable that a fair fraction of the 
HDF galaxies, which are detected in spite of (1 + z) 4 bolo- 
metric surface brightness dimming, have halo radii (gaseous 
and dark matter) in the 50-200h _1 kpc range. 

Once gaseous haloes overlap, it is likely that in some 
fraction of cases (e.g. low relative velocities), these haloes 
will merge in much less than a Hubble time. Among the halo 
pairs that merge, some fraction of their baryonic, 'galaxy' 
cores will also merge rapidly. This removes pairs from the 
pair distribution. So, the pair probabilities, hence £, should 
be lower than that expected from the low r extrapolation of 
£ from its cosmological context. 

It is interesting to note the argument of Bartlett & 
Blanchard (1996) that dynamical estimates (from the cos- 
mic virial 'theorem') are consistent with dark matter halo 
radii larger than conventionally assumed, i.e. bounded below 
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Figure 3. Angular correlation function w shown as a function 
of angle 8 for the three WFC fields, shown as crosses (chip #2), 
squares (chip #3) and triangles (chip #4), using cq. (24) of Hamil- 
ton (1993) to estimate w, where ri es ^ is the mean number density 
of the fields (cq. 6 here) and where regions occupied by large ob- 
jects are masked. Error bars on these points are Poissonian. The 
filled circles and their vertical error bars (in bold) show the mean 
values of w for the three chips considered as independent samples 
and the standard errors in the mean. Horizontal error bars show 
the bin sizes. A power law fit to the mean values is shown. See 
Table 1 for key numerical values in this and following figures. 



by 300h _1 kpc. However, there is no indication that matter 
extending out to these large radii would be baryonic, i.e. col- 
lisional, so rapid merging of pairs would be unlikely. In such 
a scenario, the expression 'halo radius' could be replaced by 
'halo impact parameter leading to rapid merging'. 

Many possible effects of pair exclusion are probably con- 
volved, but the observational constraints are weak. So, the 
correction adopted here is simply to include a cut-off in £ 
by a single parameter representing a 'typical halo radius', 
r halo- We consider how this affects the interpretation of the 
observed values of w. The cutoff is modelled smoothly as 




r halo 



-r/(l + z)] 2 



+ z) > r halo 
| , r/(l + z) < r halo 



(8) 



where r"^ a [ is in proper units, r/(l + z) is the proper separa- 
tion corresponding to r in comoving units, and a — Hialo/^- 
This is a Gaussian cutoff, so that the probability of a pair 
of galaxies existing at r/(l + z) = Talo/2 is multiplied by 
e -1 / 2 ?s 0.61, and that of a pair at r/(l + z) = Talo/10 i s 
multiplied by e ( " 9/5)2/2 w 0.20. 



4 RESULTS 

Galaxies in Mobasher & Mazzei's (1998) HDF catalogue 
which have redshifts estimated as 1.5 < z < 2.5 are shown 
in Fig. 1, for apparent magnitudes 23 < J/f300W — 27, over 
which the catalogue should be complete. Galaxies in areas 
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Figure 4. As for Fig. 3, but without any masking for regions 
around bright objects. 



0.8 
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Figure 5. As for Fig. 3, but without any merging of close objects. 



biased by large objects are excluded. The galaxy positions 
are those of Mobasher & Mazzei, the galaxy images are from 
the the HDF archival F300W image and Williams et al.'s 
(1996) catalogue is used for the positions of large objects. 
To be conservative, objects detected at less than 



"mask : 



(9) 



from the centre of a large object, where A — 10 sq. arcsec is 
the solid angle within the isophotal limit of the large object, 
are excluded. Williams et al.'s catalogue includes several es- 
timates of the isophotal areas for the brightest objects, de- 
pending on how many components of what seems to be a 
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Table 1. Estimates of amplitude and slope of w(z f» 2, 2" < 8 < 40") for different techniques. Columns show number of galaxies N g ; the 
assumption A for re-introducing large scale variance being either A= N, a constraint by n es ^, or A= 7, a constraint by assumption of a 
power law of slope 1 — 7; the value of 1 — 7 assumed for parameters present in later columns; effective angle 9\ — f (weakly dependent on 
assumed slope); w(9 pa 5") and its standard error in the mean a— estimated free from power law assumptions using cq. (24) of Hamilton 
(1993); an upper limit to (1 — 7)+ for a power law fit to the same derivation and its uncertainty; w(9 sa 5") and lcr uncertainties based 
on the estimator of Landy & Szalay (1993); w(9 fa 5") and lcr uncertainties based on eq. (15) of Hamilton (1993). The values of w are 
linear, not logarithmic. The first two lines are for masking of large objects, the second two without masking. 



N g A (1 - 7) 6»i_ 7 Hamilton (1993) eq. (24) LS (1993) Ham (1993) cq. (15) 

tot #2 #3 #4 w(6) a{w) (1-7)+ <r[(l- 7 )+] «i(0)i_ 7 <r[m(fl)i_ 7 ] w(fl)i- 7 ^Kgji-j 

Masking: 



142 55 35 52 


N 






0.10 


0.09 


-0.47 


0.56 






142 55 35 52 
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-0.7 


5.5 










0.20 


0.11 


142 55 35 52 
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-0.8 


5.3 


0.16 


0.10 


-0.96 


0.27 


0.19 


0.10 


142 55 35 52 
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-0.9 


5.1 










0.18 


0.10 


Without masking: 
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0.16 


0.07 


-0.51 


0.44 
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0.27 
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150 59 35 56 
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-0.8 


5.3 










0.26 


0.10 


Without integral constraint correction 


(C = 
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0.09 
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Without merging of close objects: 
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0.12 


0.09 


-0.66 


0.55 






153 61 35 57 
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-0.7 
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-0.8 
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Figure 6. As for Fig. 3, but varying n es ^ in order to best fit w to 
a power law with slope 1 — 7 = 0.8. The slope of the curve shown 
is 1 - 7 = 0.85. 



single object are counted as a single one. This is seen as- 
several concentric rings around a single bright object. The 
masked zone in such a case is defined by the largest ring. 

Fig. 2 shows a similar image, but without any masked 
regions. It is clear by comparison of the two images that 
several of the 'voids' in this figure correspond to the larger 
of the masked regions in Fig. 1. This suggests that galaxies 
have been missed in these regions. 




8 (in arcsec) 

Figure 7. As for Fig. 3, except the calculation of w is based 
on Landy & Szalay's (1993) estimator plus an integral constraint 
additive correction (eq. 5) for individual fields such that the cor- 
rected correlations are power laws of slope 1 — 7 = —0.8 (cf. 
Villumscn ct al. 1997; Connolly et al. 1998). The power laws re- 
quired for the corrections for the individual fields arc shown. The 
mean values arc calculated after the corrections. 

4.1 Estimates of w 

Fig. 3 shows the angular correlation function estimated 
using eq. (6) for the selected galaxies shown in Fig. 1. 
Large-scale variance has been introduced via an estimate of 
7f^j~ 'external' to the sample, as recommended by Hamilton 
(1993), without making any assumption about the shape or 
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Figure 8. As for Fig. 3, but using cq. (15) of Hamilton (1993) 
[eq. (2) here]. 



slope of w This is done by considering the three WFC fields 
to be independent samples, and n^j~ to be the global mean 
density for the three fields. 

Strictly speaking, large scale variance at scales larger 
than the combined WFC fields would still be missing from 
our estimate of nT^t . The standard deviation of the num- 
ber of objects per WFC field is 10.8 (with masking and with 
merging of close objects), and the mean is 47.3, i.e. the stan- 
dard error in the mean of the n es i estimate is about 13%. 

One way of potentially reintroducing the large scale 
variance is to follow the convention of fitting to a power 
law of a given shape and slope. This would only be correct 
if the true shape and slope are in fact those chosen. This is 
discussed further in §4.1.3. 

The three fields, considered as independent samples, are 
averaged and the point in the lowest angular bin (at 8 w 5") 
is considered to be the estimate of the amplitude of w. A 
fit to the three points in 2" < 6 < 39" is considered as an 
upper limit to the slope 1 — 7. 

Table 1 lists numerical values for this estimate and for 
other estimates which take into account possible systematic 
errors and conventional estimators for w. The median angle 
in the 6 ~ 5" bin is also listed. Since this depends on the 
slope 1 — 7 and is only slightly higher than 5", the bin can 
be referred to as the '5" bin' for brevity. 



4.1.1 The Effect of Masking 

Comparison of Fig. 3 and Fig. 4 shows that several 'voids' 
in the unmasked galaxy distribution shown in Fig. 2 corre- 
spond to masked areas shown in Fig. 1. This has a notice- 
able effect on w: the value increases by about 60% if the 
'voids' are considered to be unbiased regions. The masked 
and unmasked values of w(5") are 0.10±0.09 and 0.16±0.07 
respectively (Table 1). 

Is the difference statistically significant? Suppose we 



consider the masked value, 0.10, to be the true value, the 
results of repeated experiments to be normally distributed 
about this with a standard deviation of 0.09, and the un- 
masked value, 0.16, to be a realisation of the same exper- 
iment. Then the null hypothesis that the unmasked value 
comes from the same experiment can only be rejected at 
a P G auss[(l w ~ 0.10|/0.09)«T < (0.06/0.09)cr] = 50% con- 
fidence level. So it is not proven to high significance that 
objects have indeed been 'hidden' by the large objects. 

The difference that does exist can be understood arith- 
metically as follows. An uncorrelated distribution punctu- 
ated by voids would become correlated; an already corre- 
lated distribution generally becomes more so. So, the effect 
is strongest for the two bins smaller than the typical 'void' 
size, as expected. 

4.1.2 The Effect of Object Multiplicity 

The effect of object splitting is small. Fig. 5 shows w esti- 
mated as in Fig. 3, but by considering objects separated by 
less than 0.5" as genuinely independent galaxies. If a sin- 
gle galaxy is seen as, say, multiple H II regions, which are 
wrongly assumed to be independent galaxies, than the corre- 
lation function would in effect be weighted more strongly by 
(a) galaxies close to these 'multiple galaxies' and (b) pairs 
of such 'multiple galaxies'. 

Effect (a), on small scales, would be noticeable if the 
multiple galaxies are located in clustered regions, in which 
case they pair up with close galaxies. The multiplicity may 
be due to either the 'halo building blocks' or the 'H II region' 
interpretations. 

If the multiplicity is due to the 'halo building blocks' 
model, then the 'multiple galaxies' would be in clustered 
regions, since there is no sharp cut in the hierarchy of clus- 
tering. In this case, effect (a) should be expected. 

On the other hand, in the 'H II region' interpretation, 
due to (1 + z) 4 bolometric surface brightness dimming, H II 
regions are favoured over disks relative to shot noise. In this 
case, there is no reason to expect these galaxies to be in 
clustered regions more often than 'non-multiple galaxies' are 
in clustered regions, apart from the extent to which star 
formation requires close galaxy-galaxy interactions and/or 
merging. 

So the presence or absence of this effect would respec- 
tively favour the 'halo building block' or the 'H II region' 
interpretations of the sub-arcsecond pair excess noticed by 
Colley et al. (1996). 

Effect (b) would be stronger as long as at least two 'mul- 
tiple galaxies' are in the sample, since it would depend on 
the square of the r.m.s. multiplicity. If the multiple objects 
are relatively rare, the effect would occur at relatively large 
separations. The effect would be to increase the correlation 
at the separations of the multiple objects. If the multiple 
objects were common enough, then their contribution would 
reflect their intrinsic correlation as a population. 

In this case, the multiple objects are rare: 6, and 5 
objects are doubly counted in the three fields respectively if 
objects separated by less than 0.5" are considered as inde- 
pendent real galaxies (see Table 1). 

Hence, effect (b) is to be expected only at large scales 
and is separable from effect (a). 

Between Figs 3 and 5, the effect is small in the 5" bin 



© 0000 RAS, MNRAS 000, 000-000 



10 B. F. Roukema et al. 



and most obvious in the 21-39' bin. This favours the 'H IF 
region interpretation, though not significantly. 

4-1.3 Conventional Estimators 

Villumsen et al. (1997) and Connolly et al. (1998) do not 
use an estimate of the mean density n^j" in order to include 
large-scale variance. They adopt the conventional technique 
of first estimating w with the mean density of the sample, 
then searching for an additive correction which best trans- 
forms the set of binned {wi} values close to a power law of 
slope 1 — 7 = —0.8. 

In the case of eq. (6), the only means of modifying this 
to fit a power law is to recalculate w for different values 
of n^t until the closest resulting set {wi} to a power law 
of desired slope is obtained. The result obtained with the 
fixed 'external estimate' of n es t gives a slightly shallower 
upper limit slope, 1 — 7 = —0.47 ± 0.56, than the conven- 
tional 1 — 7 = —0.8. So, it could be expected that this more 
conventional constraint would give a slightly lower value for 
w(5"), in order to increase the steepness of the slope. 

However, this is not the case here; eq. (6) is not depen- 
dent on a simple additive parameter as is the case of eq. (5). 
Fig. 6 and the entry in Table 1 show that the closest slope 
to 1 — 7 = 0.8 attainable is 1 — 7 = —0.96, and that w(5") 
increases from 0.10 to 0.16. 

The formal significance of the difference obtained by 
using the power law constraint is (by coincidence) numeri- 
cally the same as that if masking is omitted. That is, if the 
fixed constraint determines the true value of to(5"), then the 
1 — 7 = —0.8 power law fit gives a value of w(5") rejected 
at only a P G auss[(l™ ~ 0.10|/0.09)ff < (0.06/0.09)ff] = 50% 
confidence level. 

This is not equivalent to using eq. (5) with the same 
power law, as can be seen from the equations, from Fig. 7 
and from Table 1. For the present data, eq. (5) overestimates 
w. 

Again, consider 0.10 ± 0.09 to represent the true value 
and true error distribution, and consider the estimate from 
eq. (5) to be an unbiased realisation of this distribution. 
Then this null hypothesis is only rejected at a fGauss[(l w — 
0.10|/0.09)<t < (0.09/0.09H = 68% confidence level. 

However, as mentioned above, not all large scale vari- 
ance is restored by using the mean of the three WFC fields 
to estimate n cs t. So, an alternative is to consider the power 
law corrected estimate using eq. (6) to be the true estimate, 
restoring the large scale variance. In that case, the (power 
law corrected) estimate from eq. (5) considered as a real- 
isation of the same experiment can be rejected only at a 
- p Gauss[(l™-°- 16 l/°- 1 0) cr < (0.03/0.10)0-] = 24% confidence 
level. 

This is not at all a significant difference, which is not 
surprising. The differences between the two formulae are in 
the linear error terms, so should be of the same order of 
magnitude as the uncertainties. 

An advantage of eq. (5) is that for a typical data set, 
uncorrected correlations [C = in eq. (5)] are close to zero 
at large angles. This gives a good chance of being able to 
find a best fit power law as close as possible to the desired 
slope, since C = gives a very steep slope, C 1 gives 
a very shallow slope, and the effect should be smooth and 
continuous. 



t 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 r 



all 




1 1.5 2 2.5 



z 

Figure 9. The photometric redshift distribution of the 1.5 < z < 
2.5 [/-selected HDF galaxies for the individual WFC fields and in 
total (normalised). 

This is equally a disadvantage, since the desired slope 
is attained even though the equivalent calculation, avoiding 
linear terms, using eq. (6), shows that this should not be 
the case, apart from consideration of the uncertainties. In 
other words, the desired slope is attained through inclusion 
of linear error terms, which is obviously less than optimal. 

So that readers can compare with other works, the re- 
sults of using eq. (5) for various assumed slopes and includ- 
ing the effects of not masking, of setting C = and of not 
merging close objects are also listed in Table 1. 

For completeness, the application of Hamilton's eq. (15) 
[eq. (2) here], which avoids large scale variance altogether, is 
shown in Fig. 8. The missing variance is clear at the larger 
angles here. 

4.2 Interpretations in Terms of £ 

4-2.1 Redshift Dependence 

Although the purpose of this paper is less ambitious than 
the study of evolution of the correlation function, eq. (1), 
the redshift range under study is large enough that a simple 
power law model [in (1 + z)\ of spatial correlation evolution 
needs to be discussed. The following evolutionary version of 
eq. (1) is therefore adopted: 

CW = (r /r) 7 (l + ^)- (3+£ - 7) (10) 

(Groth & Peebles 1977), with r and ro in comoving coordi- 
nates as before, and e a factor representing evolution. 

As argued by Peacock (1997), a double power law may 
provide a more observationally and theoretically justified fit, 
or as shown by Hamilton et al. (1991), a rational polynomial 
fitting function based on dark matter only iV-body simula- 
tions can provide a theoretically justified model. 

Given the uncertainties in the HDF data, it seems pru- 
dent to simply adopt the power law model. Moreover, since 
the clustering is at a strongly non-linear scale, it should 
be expected that it is stable in proper units, in which case 
eq. (10) with e = 0.0 should be a fair approximation. This is 
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r Q (in h _1 Mpc) 

Figure 10. Angular correlation function amplitude uj(5") calcu- 
lated using eq. (7) and eq. (10) for a cutoff in the spatial cor- 
relation function £ at pair separations < »*o [ ec l- (10)], shown 
against ro in h _1 Mpc. The three sets of curves from top to 
bottom are for correlation evolution parameters e = —1.2,0.0 
and 0.8 respectively. Solid, dotted and dashed curves are for 
metric parameters (Qo = l,Ao = 0), (Qq = 0.1, Ao = 0) and 
(Qq = 0.1, Ao = 0.9) respectively. The photometric rcdshift dis- 
tribution of the 1.5 < z < 2.5 [/-selected HDF galaxies (Fig. 9) 
is used in the integrals of eq. (7). The HDF estimate of w(5") 
obtained using eq. (24) of Hamilton (1993) (Fig. 3) is reproduced 
here by the horizontal lines, mean in bold, error bar as thin lines. 
1 — 7 = —0.8 is adopted. 




r halo ( in h 1 k P c ) 

Figure 11. Angular correlation function amplitude w(5") cal- 
culated using eq. (7) and eq. (10) for a smooth cutoff in the 
spatial correlation function £ at pair separations ^S^ na i , in 
h _1 kpc, [eq. (8)] to take into account the non-zero size of dark 
matter halo radii. The present-day correlation length is fixed to 
ro = 5.5h _1 Mpc. Other parameters and line styles are as for 
Fig. 10. 

with low redshift estimates of ro ~ 5.5h _1 Mpc (e.g. Davis 
& Peebles 1983; Loveday et al. 1992), but the low density 
metrics are clearly consistent with proper length stable clus- 
tering. 



the possibility discussed primarily here, though other possi- 
bilities such as the inclusion of the effects of bias could, in 
principle, also be considered. 

Other possibilities for correlation function evolution in- 
clude e = 7 — 3 (stable clustering in comoving coordinates) , 
e = 0.8 (linear growth for f2o = 1, Ao = 0) and higher values 
for e in the transition zone between linear and non-linear 
regimes. 

The smoothed redshift distribution of Mobasher & 
Mazzei's catalogue is shown in Fig. 9. For the differences 
in the correlations between the three WFC fields to be at- 
tributed to redshift evolution, this would have to be in the 
sense of stronger correlations at higher redshifts (Ogawa et 
al. 1997). Noise within the error bars shown would be a more 
conservative interpretation. 

4-2.2 Zero Redshift Correlation Lengths ro 

The correlations at redshift zero for proper length stable 
clustering (and for two other values of e) are shown in Fig. 10 
for 1 — 7 = —0.8. As is well known (Yoshii, Peterson & 
Takahara 1993), lower values of w are expected for low den- 
sity and for low density A-dominated metrics than for an 
Qo = 1, Ao = universe. 

The w(5") estimate for this 'no-evolution' model is 
equivalent to ro = 2.6~^\'\'h~ 1 Mpc (f2o = l,Ao = 0), ro = 
3.912'^h- 1 Mpc (fi = 0.1, Ao = 0) or r = BStlgh.- 1 Mpc 
(fi =0.1,A = 0.9). 

The high density metric implies about 2a inconsistency 



4.2.3 Halo Radius Cutoff 

If the non-zero size of halo radii is taken into account, using 
eq. (8) and ro = 5.5h _1 Mpc the measured value of w(5") 
implies halo characteristic radii which are very reasonable. 
This can be seen in Fig. 11 

For clustering stable in proper units and this fixed value 
of ro, halo radii are rjj a i = 70t3 !0 h~ 1 kpc (Qq — 1, Ao = 0), 
r halo < 350h _1 kpc (la upper limit; Qo = 0.1, Ao = 0) and 
r halo < 210h~ kpc (la upper limit; Ho = 0.1, Ao = 0.9). 
Because no pairs of galaxies can be seen at less than the 
separation perpendicular to the line of sight, r na j has no 
effect below a certain value, so for low density metrics only 
upper bounds are found. 

These numbers are totally consistent with estimates 
from quasar absorption systems, if these are samples of a 
similar population of objects, of which some fraction merge 
quickly once the gaseous haloes overlap. 

If the w(5") value estimated from the HDF had been 
much lower, say, a factor of ten lower, then the halo radii 
would have been considerably higher. By extrapolation, 
they would have been r na j = 680^3™° h _1 kpc, r na j Q — 
SOOl^h -1 kpc and r halo = 300l^J°h _1 kpc respectively 
for the three metrics, suggesting larger galaxy exclusion radii 
than the estimated sizes of the absorption systems. 

4.2.4 Angular Dependence of w (6) 

The angular dependence of w might be thought to have a 
low 6 cutoff in the presence of a non-zero value of r na i Q . 
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6 (in arcsec) 



Figure 12. Angular correlation functions with and without a 
halo cutoff of ?* na 2 = 100h — 1 kpc, for clustering stable in proper 
units. Different line styles are for different metric parameters as 
in Fig. 11. Thick lines are calculated with the halo cutoff, thin 
lines without. The circles are simply 1— 7 power laws extrapolated 
from the right-hand side of the plot. Our HDF estimates obtained 
using cq. (6) are also shown. 
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Figure 13. As for Fig. 12, but for clustering stable in comoving 
coordinates (e = 7 — 3) and for r na i = 200h~ 1 kpc. 

Fig. 12 shows that this is not the case. A simple round 
number of rfo a i = lOOh- 1 kpc was chosen for illustration, 
for the three choices of metric and no clustering evolution 
in proper units. This cutoff clearly has an effect. However, 
because w is a projection of £, pairs of galaxies separated 
by r/(l + z) > rjjaio and whose separation vectors are ori- 



ented towards the line-of-sight are seen with perpendicular 
separations less than r ria [ G , so there is still a strong signal. 

This figure also shows that the high density metric 
model provides a less good fit than expected solely from 
considering w(5") as an amplitude estimate. An ri la [ G = 
100h _1 kpc halo size brings it into nearly la consistency 
with the w(5") estimate, but it is inconsistent with the 
12-21" bin. 

Of the three metrics used for the figure, the low density 
metrics provide the best fits. Since perpendicular distances 
for a given redshift and angle are smaller for these metrics 
than for a high density one, the effect of rjialo becomes neg- 
ligible. 

Alternative ways to best fit all points would be to in- 
crease e to a value higher than that for stable clustering in 
proper units, i.e. e > (e.g. Efstathiou et al. 1991; Brainerd 
et al. 1995), or to consider a bias factor (e.g. Ostriker 1993). 

However, the uncertainties shown in the amplitudes of 
w{9) in the figure are only statistical uncertainties. The dif- 
ferences in Figs 3-8 suggest that systematic uncertainties in 
the second and third bins could be too large for detailed 
comparison of different metrics or models of bias. 

How strong could the effect of rj la [ G could be? For 
stronger evolution with redshift than e = 0, the expected 
values of w would be lower than in Fig. 12, so non-zero 
r halo would either have a statistically insignificant effect or 
would imply lower values than observed. 

A lower limit to evolution is that of clustering which is 
constant in comoving coordinates. Fig. 13 shows this case, 
for a round value of r^ a i = 200h~ 1 kpc. This is sufficient 
to bring a high density metric model to within ~ 3a of the 
w(5") value, but it remains highly inconsistent with w at 
all larger angles. The A-dominated metric model is brought 
within the la limit of the w(5") value, but still remains 
inconsistent at about 2a with the 12-21" bin. 



5 DISCUSSION 

How do these results compare with the other HDF analyses? 

5.1 Comparison with Villumsen et al. (1997) 

Table 1 of Villumsen et al. (1997) shows that their estimate 
of w which could be most closely compared to the present 
one would be for their R < 29 magnitude limited sample, 
which is modelled to have a median redshift of z mc d~ 1-9, 
similar to that of our {/-band plus photometric redshift se- 
lected sample. Since the redshift distribution of Villumsen 
ct al.'s sample is much wider, a lower amplitude should be 
expected. 

Their amplitude is equivalent to w(5.3") = 0.017T0.009 
(using the effective angle over our '5" bin' for 1 — 7 = 0.8). 
Because of the way that angular diameter changes with red- 
shift, the three-dimensional separations probed by Villum- 
sen et al.'s sample should not be very different from that 
of our sample. So, these separations should be mostly in 
the strongly non-linear regime, where no evolution in proper 
units is expected (e = 0). 

Therefore, most of the difference in amplitude should 
come from the effect of the superimposition of many 'inde- 
pendent' slices, each intrinsically auto-correlated (£», = £) 
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but having zero correlation with other slices = 0). An or- 
der of magnitude estimate of this effect would be to take the 
half-maximum points of the redshift distribution in Fig. 9 
as the limits of dN/dz used here, i.e. 1.6 <,« <,2.2, to take 
z med/2 an d 3z me( j/2 as the limits of Villumsen et al.'s dis- 
tribution for 7? < 29, and count the number of 'slices' of 
our sample that would fit in theirs (using comoving distance 
coordinates). The number of our 'slices' which would fit in 
theirs are N = 3.4-3.2 (depending on the metric parameters 
in the order listed above). Their correlation amplitude can 
then be summed over pairs of slices ij as 



(11) 



times our value, i.e. around 1/3.3 times ours. 

In fact, their w value is about 1/7 of ours, so would 
be equivalent to roughly half of ours if restricted to a single 
'slice'. For 7 = 1.8, this would imply that their inferred value 
of ro (for a given set of metric parameters and no proper co- 
ordinate evolution) would be ~ 2 -1 ' 8 « 0.3 times our value. 
For fio = 1, Ao = 0, our estimate of ro = 2.6l 1 '7h _1 Mpc 
would become r = 0.75jl5i;;jh _1 Mpc. 

Villumsen et al.'s Fig. 3 (left panel) suggests that their 
modelling via Limber's equation would imply about r ~ 
1.8 ± 0.5h _1 Mpc for Q = 1, A , e = 0, so this rough esti- 
mate from eq. (11) underestimates the integrated value by 
less than half an order of magnitude. 

The ratio between Villumsen et al.'s ro value and ours 
is about a factor of 0.7. Although this disagreement is only 
significant at ~ la, and the two data sets are selected quite 
differently, both data sets do consist of HDF galaxies, so it 
is worth commenting on possibilities for a systematic rather 
than random error. 

Villumsen et al. have 1559 galaxies; we have 142. Vil- 
lumsen et al. clearly have an advantage in numbers which 
should reduce Poisson error. 

They also have the property of selecting galaxies in rest- 
frame wavelengths much closer to those of major low redshift 
galaxy surveys than of our [/-selected sample. This would 
be an advantage if z ~ 2 galaxies were statistically similar 
to low redshift galaxies, but less so if a large proportion of 
z ~ 2 galaxies are either starbursting galaxies or ellipticals 
with UV upturns. 

Villumsen et al. also have the disadvantage of combining 
galaxies of widely differing redshifts, which should increase 
noise, and of not knowing the precise shape of the redshift 
distribution. This could very easily provide a systematic er- 
ror of a factor of 0.7 in ro, e.g. see Fig. 2 [panel (a)] and 
Fig. 3 of Roukema & Valls-Gabaud (1997). 

Other possible systematic uncertainties are the inclu- 
sion of linear uncertainty terms and the correction for re- 
gions obscured by large objects. Villumsen et al. used eq. (5) 
rather than eq. (6) and adopted the power law of slope 
1 — 7 = 0.8 constraint. They did not refer to masking for 
bright objects. 



5.2 Comparison with 'Low' Redshift Samples 

Connolly et al.'s (1998) w estimates are based on photo- 
metric redshifts of 926 ?7f814W < 27 galaxies in the HDF 
and 0.4 < z < 1.6. For e = 0, over the range 0.2 < Q a < 



1, Ao = 0, the authors infer ro * 2.8 ± 0.3 [1<7 error, from 
their Fig. 3(a)]. 

This is close to our best estimate for Qo = 1, Ao = 0, but 
lower by 0.4<r than our result for fio = 0.1, Ao = (§4.2.2, 
using our uncertainty). So even without knowing what value 
of fio corresponds to their best estimate for ro, their result is 
consistent with ours. The caveats regarding the use of eq. (5) 
and not correcting for regions biased by large objects also 
apply here. 

Other correlation function estimates based on either 
spectroscopic or photometric redshifts include those of 
Le Fevre et al. (1996) (spectroscopic, z me( ^— 0.53) and Gi- 
avalisco et al. (1998) (spectroscopic, z me( ^= 3.0) and Mi- 
ralles et al. (1998) (photometric, z me( [— 3.7). 

The first two of these estimates are for fields of angles 
about a factor of ten larger than that of an WFC field, so 
cover the transition scale between the quasi-linear and non- 
linear regimes of galaxy clustering and may not express clus- 
tering fixed in proper units. Nevertheless, we discuss these 
briefly. 

Le Fevre et al.'s z me( }= 0.53 result for the present-day 
value of r and e = (§4.1-1. of Le Fevre et al. 1996) is 
r = 3.0 ± O^h- 1 Mpc (fi = 1,A = 0) or r = 3.9 ± 
0.2h _1 Mpc (fi = 0.2, A = 0). This is consistent with our 
result. However, this is likely to be a coincidence. The effect 
of a non-zero size of r^ a i which can decrease the inferred ro 
on the HDF scale studied by Villumsen et al., Connolly et 
al. and ourselves, is replaced in the case of Le Fevre et al., 
by clustering growth stronger than e = which decreases 
the values of ro at low redshifts (cf. §5.3). 

It should be kept in mind that the galaxies in our sample 
(Mobasher & Mazzei 1998) are selected at rest wavelengths 
of around 1000A, so comparison with correlation functions 
estimated for low redshift galaxies is difficult. Mobasher & 
Mazzei find that the galaxies in the sample are mostly ellip- 
ticals and starburst galaxies. 

At low redshifts, the former have slightly higher corre- 
lations than the general galaxy population. 

If the latter are a random selection of young disk galax- 
ies which have just collapsed and started a rapid burst of 
star formation, then the corresponding correlation implied 
from low redshifts should be slightly lower than that of the 
total population. On the other hand, if only those young disk 
galaxies which happen to be close to one another have star 
bursts in order to be selected among our HDF 'UV drop-in' 
sample, then the correlation could well be higher. 

Modelling the combination of the two populations 
would also require knowledge of the cross-correlation be- 
tween them. 

So, the overall population mix observed at 2 ~ 2 is 
likely to represent a complex mix of the populations at low 
redshifts. The different effects might cancel each other out, 
or cause a significant systematic effect. 

5.3 The Decreasing Correlation Period 

Giavalisco et al.'s result at z mc d— 3.0 for Lyman break 
galaxies (LBG's) is for a large value of ro. Expressed as 
values for clustering fixed in proper units [i.e. multiply- 
ing by (l+£ mcd ) llM7 ~ 3)]/7 ] this is r = 5.3lJ' (1 h- 1 Mpc 
(fi = 1, A = 0) or r = 8.4±^ jh" 1 Mpc (fi = 0.2, A = 0). 
This is about 2a larger than our estimate. 
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As these authors discuss, this seems to be a detection 
of the decreasing correlation period (DCP; Roukema 1993; 
Brainerd & Villumsen 1994; Ogawa et al. 1997; Bagla 1998; 
Steidel et al. 1998; Moscardini et al. 1998; Coles et al. 1998). 
The DCP is the period of large galaxy formation when the 
transition from linear perturbations to non-linear collapsed 
haloes may have caused a high bias factor in the initial cor- 
relation function of these haloes, which later disappeared as 
perturbations in underdense regions also collapsed. 

Since most of the signal in w of Giavalisco et al. is for 
9 > 20", this is corresponds mostly to larger length scales 
than our measurements. Their point which appears to be for 
a 0-20" bin (Fig. 2 their paper) is lower than their power 
law fit, so they may not have a detection of the DCP at this 
scale. 

Nevertheless, the DCP is clearly absent at the small 
scales measured in this paper from the HDF. Following 
Ogawa et al. (1997), a simple power law extension of eq. (10) 
is 



£(r,z) = 





" (i+*) " 


! 


(l+zt) 




ro/rH 



£{r, zt), 
l + z)-< 3+e - 



7) 



2 > Z t 

Z t > Z > 



(12) 



where correlation evolution and parameters for low redshift 
are as for eq. (10), but a transition redshift z t and DCP 
slope v parametrise the evolution during the DCP. 

If the present study does not miss the DCP due to the 
smallness of the length scale studied, then our result com- 
bined with that of Giavalisco et al. would imply that Zt lies 
somewhere in the range 2 < z t < 3. 

This would imply that the transition from the stage of 
the first collapse of large haloes in high density regions to 
the stage when most haloes of the same mass scale have col- 
lapsed finishes just before the time of major star formation 
(cf. Madau et al. 1996). Corrections to Madau et al.'s plot 
for dust (e.g. Mobasher & Mazzei 1998; Hughes et al. 1998) 
would imply an earlier and less sharply peaked maximum in 
the volume-averaged star formation history, so the overlap 
between the two epochs would seem to be even stronger. 
That is, the major star formation period would start just 
when most haloes of large mass have collapsed. 

Finally, we consider the analysis of Miralles et al. (1998) 
for HDF galaxies at 2 mc( j= 3.73. The measured correla- 
tion function for these galaxies is less easy to interpret than 
that of the present paper. The most significant correlation 
is in the bin centred at 6 ~ 30". This could either be (a) a 
clustered region occurring by chance at this particular red- 
shift, or (b) a detection of the DCP to a slightly higher 
redshift than that of Giavalisco et al.. Expressed for an 
Q = 1,A = universe and e = 0, the correlation length 
is r = 7.1 ± 1.511- 1 Mpc (Miralles et al. 1998). 

That is, the amplitude is higher than that of Giavalisco 
et al. (1998), expressed in terms of the same parameters. 
Since it is at a higher redshift this is just what would be 
expected from the DCP. 

If we take the two estimates together as estimates of 
the initially highly biased correlation above that expected 
for clustering fixed in proper coordinates, adopt la uncer- 
tainties of <5(2 mec j) = 0.1 for the two data sets, and use 
1 — 7 = —0.8, then the value of v in eq. (12) is v = 2.1 ± 3.6. 
Moreover, if we consider our result and that of Connolly 
et al. (1998) to be the 'stable' non-linear correlation length 



achieved at the end of the DCP, ro ~ 2.6±1.4, then the tran- 
sition redshift marking the end of the DCP is z t = 1.7 ±0.9. 

This is clearly consistent (within the error bars) with 
the lack of detection of the DCP in the present work. That 
is, the extrapolation of a simple power law evolution through 
the estimates of Miralles et al. (1998) and of Giavalisco et al. 
(1998) implies that the DCP finishes just slightly below z — 
2, i.e. has mostly disappeared by the epoch of our estimate. 

In addition, these values happen to lie well within the 
constraints derived from diverse TV-body models (§3, §6, 
Ogawa et al. 1997). 

However, since the Giavalisco et al. result is for larger 
scales, it would be more consistent to compare the two at 
scales for which stable clustering in proper units may not be 
valid, in which case the analysis would be a lot less trivial 
than the simple estimate made here. 

5.4 Halo Radii 

Our value of ro = 2.6t!'.7h~ 1 Mpc is lower than typical low 
redshift values of r ~ 5h _1 Mpc. As shown in §4.2.3 and 
§4.2.4, for ro = 5.5h _1 Mpc and e = 0, the correction for 
halo pair exclusion implies very reasonable values of r^aio 
to match the measured value of w(5"). 

Matching the 12-21" bin in addition (if ignoring sys- 
tematic uncertainties), would require a low density metric, 
a higher value of e or a moderate anti-bias, in which case 
the correction for non-zero 'halo radii' is likely to be consid- 
erably smaller for the present data set. 

Could the effect of r^ a j G be detected in other data sets 
? Both Fig. 5 of Miralles et al. (1998) and Fig. 3 of Postman 
et al. (1998) show decreases in slope below around 30" and 
60" respectively, which are qualitatively similar to those in 
Fig. 12 and Fig. 13. 

In Fig. 12, the effect of rjj a i = 100h _1 kpc could be ex- 
pected to occur below angles corresponding to perpendicular 
(proper) distances of 100h _1 kpc, i.e. 14-24" for the three 
sets of metric parameters. It is clear that the effect, inte- 
grated precisely via Limber's equation, occurs just slightly 
below these angles. 

In that case we can estimate the values of r^ a i below 
which values of w falling below a projected 1 — 7 = —0.8 
power law could be explainable simply by our halo cutoff 
formula. The angles in Fig. 5 of Miralles et al. (1998) and 
Fig. 3 of Postman et al. (1998) at which these falloffs oc- 
cur (in the latter just the two fainter slices are considered) 
would imply values of r^ a i just slightly larger than 100- 
210h~ 1 kpc and 250-360h~ 1 kpc respectively. 

The former is reasonable. The latter is rather large, and 
for 'halo exclusion' to apply, would require an explanation of 
why galaxies rarely occur close to each other (relative to the 
expectation from a power law £) at z ~ 0.8, but are able to 
occur this closely for long enough periods at lower redshifts, 
such that the effect disappears in Postman et al.'s analyses 
for brighter magnitudes. 

Low redshift estimates of £ at short separations would 
be good for comparison, but few exist. Davis & Pee- 
bles (1983) analysis is consistent with eq. (1) on scales 
10h~ kpc^Sr^lOh -1 Mpc, but the correlations are quite 
noisy at the small scales of interest. A more recent estimate 
is that of Tucker et al. (1997), who finds a similar result 
down to about 20h~ 1 kpc, but for the redshift-space corre- 
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lation function rather than the 'real' (i.e. spatial) correlation 
function, so this also is not easy to interpret. 



6 CONCLUSIONS 

We have estimated the amplitude of the angular two-point 
galaxy auto-correlation function w(6) for galaxies at z ~ 2 
from a a 17 < 27 complete sub-sample of the HDF, and 
find a result compatible, though slightly higher, than that 
of Villumsen et al. (1997), although the two samples have 
similar estimated median redshifts. The use of photometric 
redshifts, the avoidance of linear error terms in the estimator 
used for calculating the correlation function and the masking 
of regions biased by large objects favour our result as the 
more accurate of the two estimates; but the smallness of the 
numbers of galaxies in our sample favours that of Villumsen 
et al. as the more precise. 

The consistency between the two samples suggests that 
the high star formation rate and domination by starburst 
galaxies and ellipticals in our [/-band selected sample com- 
pensates for any effects due to the difference in rest-frame 
wave-bands between our sample and other samples. Since 
[/-band selection also has the advantage of a good high red- 
shift cut-off and helps to estimate photometric redshifts, this 
seems a useful strategy to complement selection in other 
wavebands. The technique could be referred to as the 'UV 
drop-in' technique (cf. Steidel et al. 1996). 

(i) Use of eq. (6), i.e. eq. (24) of Hamilton (1993), is 
illustrated in observational data, possibly for the first time. 
It is shown how this compares to the use of eq. (5). Both 
equations are similar to that of Landy & Szalay 1993, but 
eq. (6) corrects for uncertainty in the mean number density, 
n^t, without re-introducing linear error terms. 

The estimate using eq. (6) requires an external estimate 
of n es i, which we estimate from the mean of the three WFC 
fields, considering the individual fields as independent ex- 
periments (but with identical selection criteria). This gives 
w(6 ~ 5") = 0.10 ± 0.09. Alternatively, adoption of the 
conventional constraint that to should be a power law of a 
given slope, for 1 — 7 = —0.8, increases this by 60%, but the 
desired slope cannot be exactly obtained. Given the former 
estimate and its uncertainty, the possibility that the latter 
is identical to the former is only rejected at a 50% confi- 
dence level. Since the variance is related to the uncertainty, 
it is unsurprising that the difference is not formally of high 
significance. 

Eq. (5), which requires an assumption about the shape 
and slope of to, and which is normally sure of obtaining this 
slope, implies values about twice as high as these, depending 
on what value of 1 — 7 is assumed. This can be rejected 
at the 68% confidence level given a true value and error 
distribution as estimated using eq. (6) and n^f from the 
mean of the three fields. However, relative to the estimate 
from constraining eq. (6) to a power law of the same slope, 
this is only rejected at the 24% confidence level. Again, the 
differences are related to the uncertainty terms, so are of a 
similar order of magnitude. 

(ii) Biases introduced in faint galaxy selection by large 
objects, in this case by what appears to be the 'hiding' of 
galaxies by Poisson noise, creating 'voids', increases the esti- 
mate cited by 60% if it is not corrected. The null hypothesis 



of not masking introducing no change is rejected only at the 
50% confidence level. 

The difference in estimates of to with and without merg- 
ing together of objects closer to one another than 0.5" 
is negligible at 5", though is noticeable at larger angles. 
This favours (marginally) the 'H II region' interpretation of 
sub-arcsecond HDF galaxy pairing relative to the 'building 
blocks' interpretation (see Colley et al. 1996, 1997). 

(iii) The scales effectively studied here are in the range 
?s 25h _1 kpc— 250h _1 kpc, so overlapping of gaseous and/or 
dark matter haloes should exclude some fraction of galaxy 
pairs. A simple formalism for correcting £ by a smooth 
(Gaussian) cutoff [eq. (8)] is presented. For clustering stable 
in proper units [e = in eq. (10)] in an Q — 1, A = universe, 
our to (5") estimate (a) implies a present-day correlation 
length of r ~ 2.6±i;J,h- 1 Mpc if halo sizes are ignored, but 
(b) for a present-day correlation length of r = 5.5h _1 Mpc 
implies that a typical halo radius is rj la [ = 70l3Q°h _1 kpc. 

This value of r^^o ' s comfortably close to what could 
be expected, although this correction was not devised to fit 
the present data. It was pointed out by Roukema & Valls- 
Gabaud (1997) as simply being a correction likely to be 
needed just because halo and galaxy sizes are nonzero. 

However, comparison of w(0) as integrated using Lim- 
ber's equation with that estimated for the z ~ 2 galax- 
ies shows that this correction is insufficient to bring an 
Q = 1,A = universe into agreement with the data for 
clustering stable in proper units. Indeed, the second bin has 
a more significant estimate of w than the first, if only statis- 
tical uncertainties are considered: to(« 16") = 0.063±0.018. 
However, comparison of Figs 3-8 shows that systematic un- 
certainties are probably a few times larger than the statisti- 
cal uncertainties for this (and larger) bins. 

In the case of a cosmological constant dominated uni- 
verse (Qo = 0.1, Ao = 0.9), rjjjjQ has little effect, and as is 
already well known for previous estimates of w (e.g. Yoshii 
et al. 1993), a good fit over all angles results for low density 
metrics and e = 0, with or without a cosmological constant 
to flatten the metric. Higher values of e or anti-bias should 
have similar effects. However, the systematic uncertainties 
in the larger angular bins imply that a detailed fit across all 
angles may not be justified. 

It should also be noted that the lack of an internal 
correction for sample variance is a common feature of all 
the HDF angular correlation function estimates, because of 
the HDF's small size. This introduces uncertainty in all of 
these estimates. For the lowest redshift samples, a power law 
constraint based on large solid angle, faint apparent magni- 
tude/surface brightness limited samples could provide an 
observationally justified way to reduce this error. For the 
objects at successively higher redshifts, surveys using UV 
drop-in and UV drop-out techniques over large solid angles 
could provide observational estimates of the large scale vari- 
ance. 

(iv) The results of Giavalisco et al. (1998) and Mi- 
ralles et al. (1998) can be expressed, for an f2 = 1, A = 
universe and e = 0, as zero-redshift correlation lengths of 
r = 5.3ti;3h -1 Mpc and r = 7.1 ± 1.5h _1 Mpc respec- 
tively. These are both strongly suggestive of a decreasing 
correlation period (DCP; Roukema 1993; Brainerd & Vil- 
lumsen 1994; Ogawa et al. 1997; Bagla 1998; Steidel et al. 
1998; Moscardini et al. 1998; Coles et al. 1998), during which 
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the first haloes to collapse do so in high density regions, so 
are highly biased relative to the underlying density perturba- 
tions. The period terminates when most haloes of the same 
mass scale, in both high and low density regions, have col- 
lapsed, so that the correlation function behaves 'normally', 
i.e. at small scales is stable in proper coordinates. 

The decreasing correlation period (DCP) of a high ini- 
tial bias in the spatial correlation function is not detected in 
our data, so would have to terminate in the range 2 < z t £ 3. 

A simple fit to the two values just cited for Giavalisco 
et al. and Miralles et al. would imply values of z t = 1.7 ±0.9 
and v — 2.1 ±3.6 in Ogawa et al.'s extension of the standard 
power law fit [eq. (12)]. That is, values of £ at redshifts 
greater than z t = 1.7 ±0.9 would be [(1 + z)/(l ± zt)] 2 ' 1±3 ' 6 
times their values at z t , for a fixed value of r in comoving 
units. 

These values are consistent with those estimated from 
./V-body simulations (Ogawa et al. 1997) and with our lack 
of detection of the DCP. 

However, since the three studies are optimised to differ- 
ent scales, and since Miralles et al.'s result might be that of 
an individual structure which is not statistically representa- 
tive, this parametrisation of the DCP should be taken with 
caution. 
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